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ABSTRACT 

A programing language is described which is applicable to 
problems conveniently described by transformation rules. By this 
we mean that patterns way be prescribed, each being associated with 
a skeleton, so that a series of such pairs may be searched until a 
pattern is found which matches an expression to be transformed- The 
conditions for a match are governed by a code which also allows sub- 
expressions to be identified and eventually substituted into the cor- 
responding skeleton. The primitive patterns and primitive skeletons 
are described, as well as the principles which allow their elaboration 
into more complicated patterns and skeletons. The advantages of the 
language are that it allows one to apply transformation rules to lists 
and arrays as easily as strings, that both patterns and skeletons may 
be defined recursively, and that as a consequence programs my be stated 
quite concisely. 



/ 



INTRODUCTION 

LISP (1) Is a concise language for describing calculations which are 
primarily concerned with set theory, especially those which may be defined 
inductively. However it often requires a complicated expression with a 
bewildering hierarchy of parentheses to state other kinds of problems, 
COMIT (2) is better adapted to calculations involving transformations of 
form because one may indicate that if he sees an expression of a certain 
type he is to replace it by another. The practical disadvantage of COMIT 
has been the complicated way It handles lists. To isolate a sublist 
requires either advance preparation or a program to seek out a string with 
balanced parentheses. One suspects there would be an advantage to 
corabjning the salient features of both LISP and COMIT. 

The conversion process which we describe was inspired by a desire 
to use a transforation language such as COMIT to describe programs for 
a variety of applications including algebraic simplification and compiler 
construction. The expressions or patterns to be recognized are constructed 
from more fundamental patterns in several ways including the system 
of parenthetical groupings used to form lists. Their analysis may be 
described by LISP, permitting one to concentrate upon the form of the 
patterns and not upon the means of their generation according to the rules 
of list processing. 

The language CONVERT has been defined through a series of LISP 
functions and it* implementation centers around the two functions RESEMBLE 
and REPLACE. The first, (RESEMBLE X L E) is a pattern recognition 
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function which distinguishes some twenty kinds of fundamental patterns 
and vhich analyzes more complex patterns built up out of those fundamental 
patterns. The second, {REPLACE D S) , is used to construct new expressions. 
These may contain portions of the original expression which have been 
recognized and identified by RESEMBLE. In addition, REPLACE permits 
the fo mat ion of a new expression by several stages because partial 
results may be formed and analyzed further. The rules for the analysis 
are applied by the control function (CONVERT HIE R) . Together with 
Its satellites it constitutes the definition of the language CONVERT. 

As indicated by writing it in the form (CONVERT M IE R), CONVERT 
is a function of four arguments. The third of these, E, ts the 
expression which is to be transformed by the given rules. There may 
be one or more sets of rules all of vhich together comprise the 
argument R, A rule itself is a pair, (P S), consisting of a pattern P 
and a skeleton S. These constituents are generally composite, having 
been generated from primitive constituents which nay either have a fixed 
meaning or be prescribed for an individual program. Whether a pattern 
or skeleton is variable or constant, it may be used in two ways 
depending upon whether it represents Just one element of an expression 
in which it occurs, or whether it could represent a scries of several 
elements. In the former case it is referred to as an expression, but in 
the latter as a fragment. 

The use of patterns is to determine the portions of E which are to 
be Identified and later used In constructing the converted expression, 
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as well as to ensure that E has a determined form. The variables which 
are thereby required are introduced into each program by means of the 
arguments H and I of CONVERT, These two arguments also serve to allow 
Che definition of complex patterns and skeletons for use throughout the 
rule sets, as veil as to Introduce the variable skeletons. 

The argument R of CONVERT is a collection of rule sets, each of which 
Is paired with an identifying name. In operation, the first rule set is 
chosen, and the pattern of its first rule is compared against the argument 
E by means of the function RESEMBLE. In doing so it makes use of a 
dictionary forned from the arguments M and I which indicate the variables. 
Should a match occur, the function REPLACE Is then used to make appropriate 
substitutions into the paired skeleton S. Should the match fail, the 
second rule is investigated, and so on. The search terminates when either 
a match is found and the corresponding skeletal substitution made or 
else when the rule set is exhausted and the expression is left unchanged. 
The other rule sets, aside from the first, nmy be invoked at a later time 
because of the provision for forming intermediate results, which may 
be further analyzed, either by reapplying the original rule set 
introducing a new one, or calling on another of the sets comprising R. 

In the remainder of the paper we shall discuss the details of this 
process much more minutely, indicate some typical applications, and relate 
our experiences in using the language. Particularly, we need to 
catalogue primitive patterns as well as primitive skeletons, and to outline 
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thc flow of control- However we shall first conclude our Introduction 

by giving a glossary of technical terms, Che usage of soot of vhlch Is 

current In the description of list and string processors, but whose 

usage in the language CONVERT we wish to make definite. 

The following terns are rather general and are concerned with the 

distinctions between the various parts of a list or string. 

CHARACTER One of the recognizable symbols froti which programs 
are constructed . 

ALPHABET The collection of admlssable characters. For present 
purposes it is the set of 48 characters available on 
the IBM 026 keypunch, including the blonk, 

STRING A linearly ordered sequence of characters. 

ATOM A string which contains none of the characters blank, 

left parenthesis, nor right parenthesis. 

DELIMITER One of the three characters blank, left parenthesis 
or right parenthesis. 

LIST A string which commences with a left parenthesis, 

terminates with a right parenthesis, and otherwise 
consists of strings, separated by one or more blanks, 
which are themselves either atoms or lists. 

ELEMENT One of those quantities which forms a list; it oay be 

either an aton, or a list, but is used with reference to 
the list of which it is a member* refers only to a single 
constituent, docs not include one of the delimiters, and 
does not mean one of the elements of a sublist. 

EXPRESSION Either a list or an atom. Unlike the custom in LISP 1.5, 
the empty list is not regarded as an atom because it is 
always represented by a pair of parentheses, (), and not 
by a special atora, such as NIL. An elenent is an 
expression but moreover has to be thought of as part of 
a list and not as an isolated entity. 

FRAGMENT A consecutive sequence of elements belonging to an 
original list, but not enclosed in parentheses. 
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While the preceding definitions are generally applicable to the 
LISP language as veil as to CONVERT, the following definitions derive 
their meaning from their intended use in describing the function CONVERT, 
To be further noted is the fact that some terms have a specific further 
meaning when applied to a pattern or skeleton, 

PATTERN An expression P, intended to be compared to the argument 
E of CONVERT, in order to discover whether E has a 
certain form, as well as to identify selected portions 
of E. 

VARIABLE A pattern or skeleton represented by a single atom and 

which represents a single subexpression of the expression 
in which it occurs. 

FRAGMENT A pattern or skeleton represented by a single atom, 
but which represents a fragment of the expression in 
which it occurs. 

MODE The specification which accompanies each variable or 

fragment which governs the circumstances under which It 
may match an expression, if it is a pattern, or under 
which substitution Is made if it is a skeleton. 



VALUE 



A parameter accompanying each variable or fragment. 



SKELETON An expression Intended to guide the substitutions 

made by REPLACE, In general skeletons are formed as 
lists from primitive skeletons, but it is also possible 
for skeletons to represent functions together with their 
arguments written in prefix form* 

RULE A pair consisting of a pattern and a skeleton, Khen 

CONVERT compares the pattern to its argument E and a 
resemblance is found, the values of relevant variables 
and fragments are substituted into the skeleton, 

DICTIONARY A list used to establish a correspondence between 
different quantities. 
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THE PATTERN RECOGNITION FUNCTION (RESEMBLE X L E) 

The pattern recognition function RESEMBLE is responsible in each 
case for determining whether the pattern half of a rule patches the 
expression E which it is desired to transform. Every pattern is 
constructed from primitive patterns so that it suffices to describe 
the primitive patterns and to call attention to the rules of list 
formation which say that a sequence of lists or atons nay be separated 
by blanks and enclosed within parentheses to form a new list. This 
process may be repeated to tons lists of arbitrary complexity. 

Actually the principle of list formation is only one of three 
techniques which may be used to construct compound patterns. To 
recognize this distinction one calls a pattern primitive if it is 
llst~primitive, but a pattern form if it is primitive in the other 
senses and not in the list sense. They are built from primitive 
patterns with the help of identifiers which prevent their dissection by 
the list analysis mechanism. 

The second principle is the principle of substitution, which allows 
us to define a single symbol as the representative of another pattern 
of arbitrary complexity which has already been formed and which is 
subsequently referred to by that none in place of the full pattern. This 
is a particularly valuable technique, not only for allowing synonymns 
for otherwise cumbersome expressions, but for the possibility it allows 
of defining patterns recursively. In order to produce a finite recursion 
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the pattern must sotoewhere contain an alternative between a terminal 
pattern and a repetitive pattern. Subject to the exercise of this 
precaution, Che introduction of a recursive pattern may be a very 
concise way to define a desired pattern. 

The third principle is the boolean composition of patterns. Since 
patterns are not predicates , they cannot be composed on the level of 
patterns but rather on the level "has the proper record been made to 
indicate a match?' 1 Such a question not only transforms a pattern into 
a predicate, but tacitly recognizes the fact that the primitive patterns 
are not only recognizers, but are capable of recording data as well. In 
this way it is possible to check whether the multiple occurrence of a 
symbol reflects a aatch always to the same subexpression, as well as 
making the matched expression available for later substitution into a 
skeleton. 

Our description of REPLACE then consists of a categorical enumeration 
of the primitive patterns and pattern forms which it recognizes. 

First among these are the constants which represent expressions. 
These include 

== The universal pattern which catches any expression. 

=ATO= a pattern which will match any a ton. 

*NUM' a pattern which will natch any nuccral. 

-ORD- a pattern which will natch any ordered set. 

The actual list depend** upon the COMVKRT processor under discussion, 
and could be extended considerably depending upon the circumstances. For 
example if there were different kinds of numbers to be distinguished, 



such as. floating point or integer, one would wish to introduce 

distinctive primitive patterns. This idea could be followed to n»re 

elaborate data types, such as arrays. The characters blank, left 

parenthesis and right parenthesis arc delimiters and cannot be used 

directly as characters in the language, so that one might wish to 

introduce pattern synonymns for thea, such as -BLA*, =LPR= and -RPR-. 

Were raw input in the form of BCD character strings available to the 

processor, it would need the ability to recognise delimiters as 

well as other characters. 

The next category of primitive patterns comprises the variables. 

These arc indicated in each individual CONVERT program by means of 

its arguments H and I, and eventually appear in the dictionary which 

is the argument L of RESEMBLE, h is a periodic list of period 3, 

in which we find the repeated cycle (... variable mode value...). 

Each variable is a primitive pattern, but the way in which it matches E 

is governed by its mode and almost always requires a parameter which 

is its value. Thus rather than describe the variables, which depend 

upon the program, we describe their possible types by listing the various 

possible nodes. 

X VAR C thd variable mode, in which the letter X is used to 

represent an expression, however complicated. X will 
match E if the LISP function (EQUAL G E) is true, but 
produces no change in the dictionary L, The variable 
mode may be used to introduce a synonymn for the atom 
X, to fix a constant in the entire CONVERT program, or 
to avoid repeatedly writing a lengthy expression. 
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X UAR * the undefined variable mode. X will match anything, 

but the entry in L Is changed to read --.X VAR E ... , 
so that as a consequence If X appears as part of a 
more complicated pattern It will have to represent the 
same quantity each time it occurs, and moreover this 
common value will be preserved for subsequent use by 
REPLACE. The value associated with a variable in the 
UAR mode is of no importance, but some value must be 
specified to preserve the periodicity of L. The 
programmer generally need not concern himself with this 
point since UAR variables are generally listed separately 
as the argument I of CONVERT, the others with their 
full mode declarations comprising the argument M* 

X PAT P the pattern mode, in which the letter X represents an 
entire pattern, however complicated* This pattern is 
the value of the variable, and one computes 
(RESEMBLE P L E) in place of (RESEMBLE XU). 

X PAV P the pattern variable mode, which is a combination of the 

modes PAT and OAR. Not only mist the pattern P match 
E, but the dictionary L is altered to read ... X VAR K ,,, 
so that the expression E which matched X wilJ not only 
be available to REPLACE, but so that if X occurs 
several times it will always match the same expression E. 

X BUV (P . ..) the bucket variable code. It is similar to the mode 
PAV, but rather than requiring that the same expression 
E natch every occurrence of the pattern variable, we 
siaply make a list of these expressions. Thus X in the 
BUV Eiode will match any expression, but the dictionary 

is modified to read ... X BUV (p E .,.) ... . 

■ 

X CUV (P K) the counting variable mode. It is similar to BUV mode, 
but rather than listing the matching expressions we simply 
count them. Thus L is modified to read ... X CUV (P K+1) 

X STL K the strictly less mode. X will match any number, which 

is strictly less than X, which must be initially specified 
as a number* not as another variable. 

X STC K the strictly greater node. It is the same as the mode 

STL, but with the sign of the inequality reversed so 
that X matches any number strictly greater than the 
number K. 

X RUL R the rule mode. R is a rule set defining the conditions 
under which X will match with E. This mode allows the 
user of CONVERT to define new modes or types of variables. 
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These are the principal modes which exist for matching variables 
to expressions. The list could be extended or modified. A diversity 
of number types or data types could require the introduction of 
Additional arithmetic comparison modes, or modes for lexicographic 
ordering. Logically some of these might be Introduced through the 
rule mode, but their separate introduction would represent a great 
convenience. 

One type of mode with which we have experimented, but not Included 
in the above list is the subset mode X SUB P t which will match an 
expression and produce as a value a list of those of its elements vhich 
match the pattern P. Should X be repeated several times in a larger 
pattern, the same list of extracted elements must result, although we 
will permit them to appear in a different order but not with different 
multiplicities, A variation of this idea, X HSU P, the maximal subset 
mode, resulted in a list of all those elements of the matched lists 
vhich appeared at each occurrence of the symbol X. 

Whenever we encounter an atom in a pattern which does not appear 
in the dictionary L t that atom will match only itself. Thus all atoms 
are primitive patterns; those which appear on the dictionary I. are 
to be matched as their mode specifies, while all others match only 
themselves, except for those such as «ATO- which have fixed meanings In 
all CONVERT programs. 
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An empty list may watch only Itself. 

There is a series of primitive pattern forms which match expressions. 

(■QUO* P) a quoted pattern. The expression P must be equal to E. 
This pattern form has two uses. It allows us to use 
the nases of patterns or pattern forms as patterns. For 
Instance if vc wish to match the atom =QUO= we would 
write ("QUO' =QU0*O, Also, it allows us to quote 
expressions which may then be compared by the simpler 
LISP function EQtfAL rather than RESEMBLE. 

(»DEF= 81 PI N2 P2 ... Q) this pattern form allows us to define 
patterns. It is thus a local version of the PAT 
mode. Within the pattern Q, if the symbols Nl, K2, etc. 
arc used, they represent the patterns PI, P2, etc. The 

pattern Q is distinct from PI, ?2 if =DEF= 

contains an odd number of arguments, but if there is an 
even number, it is simply the last pattern in the list. 
=DEF= permits the recursive definition of patterns. 
For example, (-DEF- H {-OR- -ATO- (B B))) Is the 
definition of a binary tree, or (-UEF- (E) ((*UR« () 
(*» « E)))) is the definition of a list of even length- 

A recursive pattern definition will generally 
involve a choice between a terminal pattern and a 
repetitive pattern which choice Is permitted by the 
pattern form »0R* or the form *0R ft which refers to 
fragaents* Khen fragments are defined, their names 
are enclosed in parentheses. Since the patterns 
within the OR pattern forms are examined in order, the 
terminal patterns must be listed first. 

(=AND= PI P2 ... Pn) all the patterns PI, P2, ... Pn mist natch 
E. If more than one of the patterns contains the same 
variable. It must match the same subexpression according 
to each of the patterns. Since the leftmost fragment 
variable 5s given precedence to match the smallest 
fragment vhen several fragment variables occur, this 
should be borne in mind in arranging the arguments of 
»AV|>», since Its arguments are also examined in order, 
from left to right. 
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(-0R* PI P2 ... Pn) at least one of the patterns PI, P2, ... Pn 
must match E. They are, considered in order, from left to 
right, and once a match is obtained, none of the others will 
be considered. The OR pattern forms are the ccans by which 
patterns nay be defined recursively. If P* fails to aatch E, 
all variables vhich may have been tentatively defined by 
means of P± are forgotten when P«j.i is considered, and only 
the variables which might have been bound by the successful 
pattern appear in the value of RESEMBLE. 

(-NOT- P) the pattern P must not match E. If it does not match, 
it cannot bind any variables, while if it does match, the 
variables it binds do not matter because the overall match 
fails. 

The next group of pattern forms which we have to discuss match 
fragments* and therefore cannot match atons> and must always appear as 
a part of a larger pattern. There is only one constant fragment pattern 

™= the indefinite fragment which will Batch any fragment. 

For example, ( es -) will match any list, (f" a= ") will 
match any list having &t least two elements, and so on. 

We may also specify fragment variables. This is done in the argument* 
M or I of CONVERT by enclosing the variable narce in parentheses. This 
usage of parentheses Is a linguistic device motivated by the fact that 
one needs to have some kind of delimiter to indicate a fragment and 
the only delimiters readily available in LISP are parentheses- If we 
enclose both the name and specification of a fragment in parentheses we 
figure that we are more or less even, and moreover have a notation 
nore or less compatible with LISP. It is important to bear In mind 
that fragments do not exist independently of a larger expression which 
contains them. 
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The way in which a fragment variable Is to be regarded as 

representing a fragment of Its pattern and thus to be eventually matched 

with a fragment of some expression Is also specified by a mode 

declaration, which is accompanied by a value which gives necessary 

parametric information to make the match. The possible modes arc 

the following. 

(... (XXX) VAR EE ...) the variable mode. EE is an expression 

which is to be taken as a fragment of E. Thus (XXX ...) 
requires that the first element of EE match (CAR E), 
the second match (CADR E) and so on, until finally 
whatever portion of X follows XXX must match the 
resulting (CD...DR E) . The elements of EE must match 
the corresponding portion of E by equality, not by 
RESEMBLE- For example, with (XXX) VAR (0 1), (XXX 2) 
will match the list (0 1 2) but not the list ((0 1) 2). 

(XXX) UAR EE the undefined variable mode. In general the value 
EE is the empty list {) . The fragment XXX will match 
with whatever fragment which is larger than EE, 

However, if XXX appears several times In a larger 
pattern it must always match with the same fragment. If 
upon seeing a subsequent occurrence of the fragment XXX 
ve find that the second fragment needed does not correspond 
to the fragment first found, we »ay revise our original 
estimate by trying a larger fragment for the initial 
occurrence of XXX. Hence we always take EE as our 
estioate of the fragment, usually the null fragment, and 
try successively larger fragments by adding elements 
to the end of EE until If possible a satisfactory 
candidate is found* 

(XXX) PAT P the pattern fragment mode. The fragment chosen to 

natch the fragment XXX must match the pattern P as well. 

(XXX) PAV P the fragment pattern variable node. Not only must 
the chosen fragment satisfy the pattern P, but for 
repeated occurrences of XXX the same pattern must appear. 
The final dictionary will contain the common fragment. 
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(XXX) BUV P The fragment bucket variable mode. It differs from the 
the PAV mode in the respect that any suitable fragment 
satisfying the pattern P will be accepted* and that the 
dictionary L will eventually contain a list of the 
fragnents which matched each instance of the pattern XXX, 

(XXX) CUV P the fragment count variable mode- It differs from the 
Bode BUV simply in the respect that the instances of XXX 
are simply counted. The principal use of such a mode is in 
recursively defined patterns for which this number is not 
known in advance and we wish to determine it. 

(XXX) REP (P K) the repeat mode. We expect that XXX is a fragment 
in which the pattern P is repeated K times. For example, 
a list of ten elements could bo defined by (XXX) in which 
we define (XXX) REP (« 10). 

Our remaining pattern forms consist of quoted and boolean combinations 

of fragments. They are always part of a larger expression. 

(*QUO* P) a quoted fragment. P is a list, which is suposcd to appear 
as a fragment of a matching expression; again it must match 
through equality. 

(SAND? PI P2 ,,. Pn) All the patterns PI, P2, ..., Pn must be fragments 
of a larger pattern P. However, they must all match 
exactly the same fragment of E. For example, 
(«o (SAKDS (X «) <— Y)) — -) will match (A (X Y) B) but 
not ((X A) (Y B)), whereas <—- <*A**D* (X «) (™ Y)) — *) 
will match both, 

(*AND* PI P2 . .. Pn) When this combination appears as a fragment of 
a larger pattern P, we 'must regard PI, P2, ... Pn in turn 
as fragments of P occupying the same position as (*AND* ... ), 
and all these patterns must match E* However the same fragment 
of E which corresponds to PI need not correspond to P2, and 
so on. For example, ((*AND* (X ™=) (=== Y =«)> X), 
when X is in UAR mode, matches any list which begins and 
ends with the same clement and which contains at least one 
Y» not the last clement. 

(*OR* PI P2 «,« Pn) At least one of the fragments PI P2 ... Pn, when 
used in place of (*0R* .,.) must result in a pattern which 
matches £. For example, (-DEF- (E) ((*0R* () (« — E))> ) 
matches a list of even lenght; (X (*0R* (* *) {X X X)) X) will 
match a list of four elements of which the first and last are 
equal when X is in UAR mode and the middle two are stars, or 
else a list of five identical elements. 
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(*NOT* P) When P Is Inserted as part of the larger expression 
contalnint (*NOT* P) p the resulting expression must not 
match E. Thus, <(*NOT* (1 2 3)) 4 5) will natch 
(32145) but not (1 2 3 4 5). 

With this we have completed our survey of the prialtivc patterns 
and pattern forms. Once a catch Is obtained to a given pattern, a 
dictionary is produced which we may then- use to make substitutions 
into a given skeleton. 

Accordingly we now Investigate the various possibilities for 
skeletons, 

THE SUBSTITUTION FUNCTION (REPLACE D S) . 

The arguments of the function REPLACE are a dictionary 
D and a skeleton S. The dictionary Is an edited version 
of the value of RESEMBLE when a successful match Is 
obtained . The skeleton Is an expression In which these 
variables are to be * replaced by their equivalents which 
they matched In the original expression E, For example^ 
if the pattern (cos x cos y - sin x sin y) were to be matched to 
the expression (cos 23* cos 19* - sin 23* cos 19*) in which x 
and y were declared as variables, we would perhaps like to 
substitute into the skeleton cos(x+y) to obtain cos(23* + 19°), or perhaps 
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even better, cos i2°. 

In addition to making simple substitutions, we would like to 
make calculations as this example indicates, as well as permitting 
the generation of still other skeletons. Moreover, It Is a very 
important aspect of the CONVERT language that we arc able to permit 
intermediate results to be formed and analyzed further. Thus a 
portion of REPLACE is devoted to some of the control functions of 
CONVERT, 

For these reasons we may also enumerate several categories 
of skeletons and skeleton forms- Again, by a skeleton form, we mean 
a pattern which becomes a skeleton after appropriate substitution 
of its constituents. One of these categories is conpriscd by the 
atomic skeletons, of which there are constants and variables. The 
constants are generally synonymns for quantities which cannot be 
written directly, such as delimiting parentheses. The variables take 
their meaning from the dictionary D, and depend upon the particular 
CONVERT program under consideration. There is then a category of 
skeleton forms which are comprised of functions and their arguments, 
which permits us to use a LISP-like functional notation within a 
skeleton. Another category allows us to introduce single symbols to 
stand for other quantities, be they expressions or skeletons. Finally 
there are those skeleton forms which allow the formation of intermediate 
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results and to continue their analysis. 

All primitive skeletons and skeleton forms may be also regarded 
as fragments, which means that they are to be inserted into a larger 
skeleton without their delimiting parentheses* 

In order to preserve some systematic notation, we have found 
it convenient to introduce a number of informal conventions, which are 
meaningless to the CONVERT processor, but which greatly Improve the 
legibility of a program and facilitate its interpretation. For 
example, we choose three letter combinations as the names of the 
primitive patterns and pattern forms (with the exception of the two- 
lettered OR and --)! while ve choose uniformly four letter combinations 
for primitive skeletons and skeleton forms. This choice Is also 
reflected in our choice of node names, which are again of three 
letters for patterns and four letters for skeletons. Dealing with 
patterns, wc have found it useful to use single characters as atoms, 
but triple characters such as XXX for fragments, unless some more 
mnemonic combination presents itself in a particular case. Agnln, 
it is useful to distinguish fragments from expressions, and for this 
reason wc surround the names with equal signs, as -QUO-, when they 
refer to expressions, or with stars, as *AND ft , when they refer to 
fragments. 

Considering the possible skeletons in detail, we commence with 
the constant skeletons. 
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-TRUE- a skeleton used by the RUL oode, which causes , 

RESEMBLE to continue with whatever dictionary 
exists when it Is encountered. 

-FAIL- the corresponding skeleton which produces a false 

answer in the execution of the RUL mode. 

-SAME- A skeleton is generally part of a CONVERT rule, 

whose pattern is being compared to some expression E. 
=SAME* refers to this expression, whatever it was. 

=LPAR= a synonym skeleton, which stands for a left parenthesis. 

=RPAR- synonym for a right parenthesis 

=BLKK= synonym for a blank. 

As in the case of the atomic patterns, this list could be 
presumably considerably extended should the particular application 
warrant. For instance, a skeleton such as »READ= could be used to 
obtain one expression frora sorae input apparatus, and its value would 
be the resulting expression. We see in the present lisr two types 
of skeletons — synonyms for delimiters which cannot be quoted, and 
referents to objects which exist within a CONVERT program. 

For the variable atomic skeletons, we have only two modes. 



X EXPR S the expression mode. Every quantitiy which was in 
the VAR mode, or was changed into the VAR mode, 
such as those previously in the UAR or PAV nodes, 
is transferred to the EXPR mode when the value of 
RESEMBLE is edited to become the dictionary of 
REPLACE. In addition, quantities may be specified 
initially in the argument M of CONVERT to be in the 
EXPR mode. When X, in the EXPR mode, is encountered, 
it is Substituted by its value, S, without any 
modification. 

X SKEL S the skeleton mode. The single atomic sybol X stands 
for the entire skeleton S t which is placed Instead 
of X, and then modified (replaced) using again the 
function REPLACE. 
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Any* atom which is not listed in the dictionary D in one of these 

two modes ^nd is not fin atomic constant —such as -SAME- — is taken 

to be itself, in other words it is copied without change. 

For example, if the dictionary D is {A EXPR (M K) K SKEL (G A)) 
and the skeleton S is (A A () (1 H) *), 
then (REPLACE D S) is <(M N) <H N) {) <1 B (M U)) *) ; 
with S x = (C A S (I) CAS), (REPLACE D S|) will be 

(C (M H) (G (M N» (I) C (M N) S) 
Note that instead of A [exprl we put its value (M K) exactly as it 
appears in the dictionary, whereas N [sfcel] is substituted by (G A) 
and then we REPLACE this, to obtain (G (H N)) . 

Among the skeleton forms, we have a large number vhich are in 

reality functions specified in the LISP prefix form, (F Al A2 ... An), 

in which P is the naae of the function and Al, A2, ... An are its 

arguments, lrt every case, all the atgumeiits of such a function a« 

replaced before it is executed. 

(=PRNT= S) prints its argument, which is its value. 

(=RAKD= SI S2) makes a random choice, probability 1/2, between 
SI and S2, the chosen argument then being replaced. 

(*COMP« A B) treats A and B as sets and calculates their relative 
complement A - B, consisting of those elements of 
A not in R. Repeated elements of A appear in the 
complements with the same multiplicity, if they appear 
at all. 

(*IHTS- SSS) forms the intersection of the skeletons comprising 

the fragment SSS, treating them as sets. If an element 
is repeated n tir.es in the intersection, it occurred 
at least n times in each argument, and no more than n 
times in at least one of them« SSS should not be empty. 

(«UNO!C» SSS) forms the union of the skeletons forming the argument 
list SSS* However, no element appears more than once 
in the union. 

(-C0NO SSS) concatlnates the argument skeletons. It is an alternative 
to ■IWOX 1 - when one wishes to preserve multiplicity and 
order. 
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(=CART= SSS) forms the cartesian product of the argument 
skeletons* once they are replaced, 

(-PLUS* SSS) sums the argument skeletons after replacing Chen. 

(•MINS- SI S2) computes the difference SI - S2. 

(-TIKS- SSS) multiplies its replaced arguments. 

(=DIVD* SI S2) forms the integer part of the quotient sl /$2< 

(-REMH- SI S2) forms the remainder of SI after division by S2. 

(-IKCR- S) adds 1 to its argument. 

(»DECR- S) substracts 1 from S; that is, its value is S - 1. 

(-ARRY" INS) forms an array of dimension N whose Tth element is 
computed according to the skeleton S in vhich I may 
appear as a variable. As is true of all these function 
skeletons, I, N, S are first replaced before any of this 
construction is attempted. If N is a list and not a number, 
the dimension of the array is the lenglit of K p whose 
elements depend on the corresponding list elements. 
If S is missing, zeros fill the array. 

(=EJJTR= SI S2 S3) stores SI as the S2nd element of the array S3. 
SI, S2 and S3 are first replaced. 

(=EXTR= SI S2) produces the Slst element of the array S2. 

All these function skeletons follow the LISP convention that their 

arguments are to be evaluated first > in this case by the function 

REPLACE which treats thea also as skeletons, before the function is 

to be executed. If Di is (T EXPR 2 U SKEL (=PLUS= T 3)) 

Si = (T U) will be transformed to (2 5); 

S2 - (<B> (=DECR» (»TIKS« T U U)> <U}) will become ((B) 49 (5))- 

When D = (H SKEL (ABC) A EXPR AA N SKEL (=C0XC= (3 A) (5 6)) ) 

and S Is (Y (=UHON* NS(13 3)) (S (-INTS- M N> S) 2) 

then (REPLACE D S) will be (Y (3 U 5 6 AA B C 1 ) (S () S) 2) 

Note that the intersection has the value () in this example, and the 

union supresses repeated elements. 
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It la clear Chat cvin^ of thoto functions arc primitive, la ;ji> ccti&c 

that they could be written in no other way, tdillo tho other* arc tcnpoaiK. 

HowovDr, rhoy havo boon lftt*nd amci^s thfi prfm'it ivo cVolctonc ae 4 

aiatter of convenience. 

Another skeleton fom clcscly related to this idea contains -ITER*. 

(-HSR- II Nl 12 N2 . .. S) In this skeleton form, li are 

variables which serve as indices vhlle Ni are their 

corresponding ranges. For each combination of 

possible Values the skeleton S, uMeh nay contain 

then as variables^ is evaluated, and a Use made of 

the resulting values. N2 may depend upon 11 f $0 that 

this skeleton fern Is equivalent to 

(-ITER- II PI <*I1ER* 12 N2 <*1TER* ... (*1TER* In Kn 5) ..#)))h 

When the rau^t SSI Is a numeral, li takes Oie values 

1, 2, 3, ..., Ni inclusive, while if Ni is a set, 11 

takes ouccceoivcly the value* (CAR Nl), (CADR Ki) , ... 

For oxonplo, ve could writ^ {-CA&T- SI 5£) in the 

*lt*rnntivo fom (-ITPR- II SI 12 S2 (II I2>) . If c 

variable ie enc-lceed in par^th^eoe, as (li) , it ic 

taken 4* representing a fragment. 

The next group of three primitive skeletoa forms alLovs us to 

introduce tenporary definitions cf skeletons or expressions* They 

accomplish locally exactly the sfzne thing which is done by the nodes 

BXPR and SKEL globally. Again It Is clear that they have considerable 

utility in allowing us to nake recursive definitions of skeletons. 

(-JU0T- PI SI 82 52 ... S) ta&kca & replacement of the *lclcton : % 
but after JuMclnins Lu the dictionary 3 the luforiu^tlon 
N2 IXPR 51 H2 EXPR S2 ..., ;ia which $1 , S2, etc., 
arc net replaced. Rather, they ara effectively quoted. 
A variation of thio skeleton fom lo (*QUOr« SJ , 
in vh^cTt $ it cGpie4 without rapl Aroint t £0 that In 
this narner primitive skeleton name ft and fom ranes 
may be referred to as themselves. 
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(=EXPR= NJ SI N2 S2 ... S) makes a replacement of the skeleton S> 

after adjoining to D the information Kl EXPR SI 1 82 EXPR S2 ' . ,. 
where S] ' t S2 ' , .,, denote the values of these skeletons 
after replacement. 

(-SKEL- Kl SI K2 S2 ,.. S) makes a replacement of the skeleton S but 

after adjoining to D the information Kl SKEL SI' K2 SKEL S2 1 ,., 
vhere by SI ' , S2 \ etc., we mean the value of Sl f S2, ... 
after replacement , 

For both the skeleton form (-EXPR- ,,.) and (=SKEL= ...) the evalua- 
tion of the arguments Sl t S2> ,., is carried out independently, so that 
in evaluating S2, Kl does not yet stand for the evaluated SI, and thus 
will retain Its previous meaning. 

For example, if ue use the dictionary (A EXPR AA B EXPR BB) to replace 
the skeleton (ABC (=EXPR= A (K A) C ((A B C)> (1 A B C 2)) ABC), we 
will obtain (AA BB C (1 (K AA) BB ((AA BB C)) 2) AA BB C) f where we note 
that only inside (1 A B C 2) A has the value (K AA) and C is ((AA BB C)) , 

There are three skeleton forms containing -C0NT-, *REPT= and »BEGK-, 
which are control skeletons governing the formation and subsequent analysis 
of intermediate results by another rule set, whose description we shall defer 
to the next section which explains the general flow of control. 

With this we exhaust the skeletons and skeleton forms which refer 
to exprcssions > and arrive to those which refer to fragments. In general 
there is a scries of skeleton fragments entilery analogous to the expression 
fragments. Of course there is no function *PLUS* since the value of -PLUS- 
is not a list, but there are functions *IKTS* t *UKOK* and so on. Then, 
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there arc a few skeletons which make sense as fragments for which no 

expression analogue exists, *ASUL* being the prototype example. 

First, we have the analogue of =SAME=. 

*SAME* is the expression which the currente rule sec is 

examining, and presuming that this is a list, is 
inserted into the proper place in our skeleton £s 
a fragment. 

Then, we have the fragment variables, for which there are four 

codes. 

(XXX) EXPR (EEE) The expression (EBB), assumed to be a list, 

is inserted in place of XXX as a fragment, but none 
of its elements are replaced. 

(XXX) SKEL (SSS) The fragment SSS is substituted for the symbol 
XXX in any skeleton in which XXX appears, before , 
replacement is made. 

XXX COST R Tills is essentially the mechanise by which we 
define functions in CONVERT. R is a rule set. If 
our skeleton contains the skeleton form (XXX Al A2 ... 

An), we construct the list resulting from 

replacing <A1 A2 ■ . . An) nnd use *r as a new expression 
E, and evaluate the skeleton form («COKT- E * R), whose 
explanation we have deferred to the next section. 
However we may simply think of XXX as the name of a 
function. The difference between the COKT and REPT 
code is that we use respectively *COKT- or =REPT=, 
whose distinction is in their treatment of free 
variables. CONT preserves previously existing 
variable definitions, RI£PT erases them. If the 
function's name is not enclosed in parentheses, its 
value is an expression, but if it is enclosed in paren- 
theses the value is treated as a fragment. 

XXX REPT R Another mode which permits function definitions, 

which is the same as COKT except that it may not use 
the values of any variables which have become defined 
in the course of the program, with certain exceptions 
covered In the discussion of *COXT=, =REPT= and =SKN= 
In the next sanction. Rather, all such variables are 
restored to their state when the CONVERT program was first 
entered * 
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We now describe those skeleton forms which correspond to 

functions whose value is a fragment rather than en expression. They 

are Identified by a * which forms part of their name* 

(*ANUL* S) After replacement of the skeleton S, nothing is 
done to the main skeleton, which is effectively 
equivalent to inserting an empty fragment in place 
of (*AKUL* S) . Clearly such a skeleton as this 
makes sense only if S is an "operator skeleton'*. 
By this we moan that in the course of replacement 
of S, some permanent or auxiliary changes are 
effected. For example, S might involve »PRNT=, 
and in this way we could write on the output tape 
without retaining a copy of what we have written in 
the expression which we are developing. 

The following are precise fragment analogues of the corresponding 
expression valued skeletons, and consequently need no further explanation. 

*QUOT* *EXPR* *SKEL* *COMP* *INTS* *UKON* *CONC* *CART* *ITER* 

*CONT* *REPT* *BE^N« 

This completes our analysis of the possible skeletons, and we 
proceed to the main function CONVERT, which is the control function 
together with its satellites, for a CONVERT program. 

THE COKTROL FUNCTION (CONVERT M I E R) 

Of the four arguments of CONVERT, the one which essentially 
specifies the program to be executed is R, the collection of rule 
sets. The expression which is to be transformed comprises the 
argument E t while the arguments M and I specify the variables which 
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arc to be used In the program. I, the argument of initially undefined 
variables, consists of a list of those variables which ar£ going to 
belong to the mode UAR, as well as those fragment* which arc initially 
empty and which belong to the mode UAR. The variables arc indicated 
as Individual atoms, while the fragments are Indicated by enclosing the 
atomic name within parentheses. Thus a program which uses the variable 
X and the fragment XXX would have an argument I of the form (X (XXX)). 
The argument M, which contains the mode definitions, may be regarded as 
a sort of annotation to the main program, In which lc is explained 
that various symbols have various special meanings. Again the same 
general system applies; variables which are to represent symbols are 
indicated by atoms, while those which represent fragments are indicated 
by atoms enclosed within parentheses. One further convenience has 
been that If the mode nainc itself is enclosed [n parentheses, one 
assumes that sufficient information is already available in the 
initial argument H to calculate its value using CONVERT, so that the 
value may be given as a skeleton to be evaluated. In this vay much 
tedious rc~cvaluation of constant values may be avoided, with a 
corresponding increase in operating efficiency. 

The argument R, the collection of rule sets, is an alternating 
list; in which the nanes of the rule sets alternate with the sets 
themselves. Each rule set is a list of pattern-skeleton pairs. In 
operation, the first rule of the first sot is taken and its pattern 
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compared to the argucent E. Should it match* the corresponding 

skeleton Is evaluated; but if not the second rule Is examined and so 

on. Whenever one arrives to the end of a rule set without a match, 

the original expression is left unchanged. 
I 

The previous two sections have contained a detailed description 

of the fragments and hoi** their match is effected, as well as of the 

possible skeletons and how substitutions are made Kith them. Thus 

it regains only to discuss the skeletons =CONT=, =REPT=, and =BEGN=, 

as well as their fragment valued analogues *COST*, *REPT** and *BEGN*. 

These skeletons permit the formation of intermediate results and 

direct the continuation of the analysis with respect to a specified 

rule set, so that they are the agencies responsible for the flow of 

control in a CONVERT program. 

There is only one skeleton form (=BEGN- S} > anc it© replacement 
is effected by first replacing the skeleton S, then using this result 
as the argument E, and starting the entire CONVERT program over again 
from the beginning* This implies in particular that all variables 
must be restored to their original status of UAR or whatever other 
mode which they had originally and which might have been altered during 
the course of execution of teh program . 

There «e three skeleton forms, (=C0S7^ S) , (=COKT= S K) , and 
(-COST- S Kl Rl K2 R2 ...). In each of these, the skeleton £ is first 
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replaced, and then taken to be the argm&ent E of CONVERT. However t the 
rule set to be applied depends upon which of the three skeleton forms 
ve are using. For the first, similar to the way wc use =BEGN=» we 
return to the current rule set. In the second, {=COKT= SK), we 
continue with the rule set whose name Is K. In the third form, we 
continue with the rule set Kl which Is defined to be Rl. In addition, 
ve are privileged to Introduce other named rule sets at the same time; 
these are R2 with the name K2, and so on. These names take precedence 
over any similar names used In the argument R, because the original 
list of rule sets is appended to the end of this new list. Names not 
usurped In this way refer to the older list. 

The difference between the skeleton forms involving =COKT= and 
those involving -BEGN- is that in continuing to the new rule set, all 
variables which may have previously arisen arc retained. Thus, in the 
rule (X (=CONT* S)), if X was originally In the mode UAR, it will 
have changed to the mode VAR and eventually to EXPR when S is 
evaluated. However, the second time we apply the rules, X will no 
longer be in the UAR mode, but will retain its identiflcaiton in the 
VAR and EXPR codes. 

The third category of skeleton forms, (=»REPT= S) , («REPT* S K), and 
(=REPT= S Kl Rl K2 R2 . ..) is the precise analogue of =BEGN=, in that 
Its members undertake to forget variable definitions and restore the 
dictionary to Its original state. Otherwise the distinction between 
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these three forms is the same as the corresponding distinction for the 
■COST* forms; they differ in the rule set they choose for the 
continuation. At this point a conflict makes itself apparent* With 
=CONT= we have no real problem because we retain all previously 
defined variables. However, =EEGN= undertakes to restore all variables 
to their original condition. 

The conflict exists because the skeleton form (-EXPR-* XI SI ... S) 
undertakes to guarantee that at all times within the expression S, the 
symbol Nl means SI*, the replaced SI. This Is presumably true no 
matter what rule sets and revisions of variables are encountered on 
accouftt o£ the skeleton faws uMftg -BEGN-, *REPT- ot -C0NT-. A 
similar situation holds with respect to -SKBL" and -QUOT*, and 
presumably also for =ITER*= and »AR-tY»*. On the other hand, "BEGS* provides 
a convenient synonycin whereby we may re-enter our progran recursively, 
and this demands that the program be in the same state every time it is 
re-entered, and that it cannot depend upon a previous history of having 
bound variables. 

The resolution of the dilemma consists simply in establishing 
a hierarchy of precedence. Thus, in decreasing order of precedence, we 
have 

-BEGN- 

-Ql T OT=, *EXPR=, =SKEL=, *=ITEIt=, =ARRY= 

=REPT- 

=CQET= 
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In this way, -BECK-, vMch has only one form since ic is tlie 
recursive re-entry to the program, causes all variables to be restored 
to their original status* The skeletons such as E EXPR* of the second 
level cause their definitions to be retained at all times within their 
argument skeleton S except when =BEGtf= or A BEGS* are encountered. 
Then, =REPT= causes all variables to be restored to their original 
condition when proceeding to a new rule set t except those which have 
been bound by one of the skeletons of the second level. Finally^ 
-COST- retains all variable definitions, whatever their origin. The 
fragment valued skeletons are equivalent to the expression valued 
versions, in this hierarchy. 

In order to illustrate these ideas, we might consider a few very 
simple programs. For example, the LISP function (MERGE L M) merges 
two lists, and is defined by 

(MERGE (LAMBDA (L M) (IF (NULL L) L (COSS {CAR L) (CONS (CAR M) 

(MERGE (CDR L> (CDR M)))))))« 
If L - (0 1 2) aod M - (ABC), (MERGE LM)- (0A1B2C). The same 
function written in CONVERT cakes the fom 

(MERGE (LAMBDA (L M) (CONVERT 

(LIST) 

(QUOTE (P Q (PPP) (<)<&W) 
(LIST L H) 
(QUOTE (* (' 

(UP PPP) (Q QQQ)) (P Q <*BEGN* ((PPP) (QQq))))) 

((() 0) 0) 

))) 

»>. 
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This second definition id a bit more space consuming because 
of the necessity to define the CONVERT program as a LTSP function, 
but its essence is conveyed by the two rules comprising the rule set* 
The argument M is an empty list because there are no node declarations 
to make, while the second argument I indicates that we are going to 
use the variables P and Q and the fragments PPP and QQQ. The third 
argument is a list of L and M, which is our expression to be transformed. 
Effectively, CONVERT functions are functions of one variable, and to 
deal with functions which wo would ordinarily regard as having several 
variables we must combine all these arguments into a single list. 
Finally the argument R, which is a collection of rule sets, contains 
only one rule set, which has been given the noncomnital name *, It 
consists of two rules, the first of which is the terminal condition and 
tells us to quit with an empty list when both the lists L and H are 
empty. Otherwise they are separated into their CAR's and CDK's by the 
second rule, which lists (CAR L), followed by (CAR M) , followed by whatever 
fragment results from recursively applying the same process to the list 
of (CDR L) and (CDR M). Were we co have written =BEGK= in place of 
*BEGK* we would have obtained a list of three elements: (CAR L) , 
(CAR H), and the merged CDR's. 

A second example is the inverse function, UNMERCE, which takes 
a list of even length and separates its odd and even elements into 
two separate lists. Defined in US? it takes the form: 
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. (UNMERGE (LAMBDA (I.) (IT (HULL L) (LIST L L) 

((LAMBDA (X) (LIST (CONS (CAR L) (CAR X)) 

(CONS (CADR L) (CADR X))>> (UNMERGE (CDDR L)))))) 

while in COXVERT we find 

(UNMERGE (LAMBDA (L) (CONVERT 
(LIST) 

(QUOTE (P Q (PPP) (QQq> (RRR))) 
L 
(QUOTE ( *( 

((P Q RRR) (-CONT- (-BEGN- (RRR)) * ( 

(CCPPP) <QQQ)) (<p PPP> (Q QQQ»> 

))) 

CO (00)) 

))) 
)))■ 

Again the four arguments oE CONVERT arc noscly trivial. However, 
we oiay see how the skeleton (-BEGN- (RRR)) corresponds to the 
((LAMBDA (X) ...) (UNMERGE (CDDR L))) portion of the analogous LISP 
function, since we are dealing with a coemon subexpression whieh we wish 
to compute in advance. Since ve are sure of always obtaining a list 
with two sublists, the solitary rule of the skeleton »CONT= simply 
serves to give these sublists names and to identify them as fragments. 
The variables P and Q were defined in the outer pattern, which is 
why "COTTCV rather than *REPT= was used in proceeding to the inner rule set. 

For our final example we choose an elementary but non-trivial 
example, a program to calculate derivatives of algebraic expressions and 
to simplify moderately the resulting expression. 
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(DER1VATTVE (LAMBDA (X E) (COX VERT 

(CONS (QUOTE T) (CONS (QUOTE VAR) (CONS X (QUOTE ( 
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PAV 


*HUX= 


I 


PAV 


-SUM* 


LL 


SKEL 


(UKLS LLL) 


r.K 


SKEL 


(UKLS RRR) 


DL 


SKEL 


(*3EGX= (UNLS LLL>> 


DR 


SKEL 


(=BEGN= (UNLS RRR>) 


UKLS 


B?VT 


«00 X)) 


.4. 


KEPT 


( 
(<K 1.) (-PLUS- K L» 

((X 0) X) 

«0 X> X> 

{(X X) (2 * X)) 

<<X Y> (X + Y)> 
> 
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* *" * 


REPT 






((X 0) X) 






((0 X) (- X)) 






((K L) (=M1KS» K I.)) 






((X X) 0) 






((X (- Y)) (.4. X Y>) 






(((- X) Y) (- (.+ . X Y»> 






(((- X) (- Y)) (.+. Y (- X))) 






((X Y) (.+. X (- Y>>) 
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.*, 


REPT 
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((«- ===> 0) 






(<X 1) X) 






<<1 X) x> 






(<K L> (-TIMS- K L» 






<(X (- Y» (- (X * Y))) 






(((- X> Y) (- (X * Y))) 






(((- X) (- Y)) (X * Y)) 






((X X) (X ** 2)) 






((X (X ** K)) (X ** (-1NCR- K))) 






(((X ** K) X) (X ** (-TRCR- K») 






(((X ** K) (X ** I.)) <X ** (-PLUS- 






((X Y) (X * Y)> 
) 
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-/- 


REPT 






((X 1) X) 
((0 --) 0) 
((X X) 1) 

(((X ** K> (X ** L)) (X *« (-MINS- 
((X (X ** 2>> (1 / X)) 
((X Y) (X / Y>) 
> 



X L))) 



K L))) 
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The actual rule set R is quite modest. After noting that the 
derivative of the variable of differentiation is 1 and that the 
derivative of other atoms is zero, there follow the rules for the four 
algebraic operations, as well as for powers. By arranging the rules in 
just this order the hierarchy of precedence of the algebraic connectives 
is established. For example, multiplication is selected as a connector 
only if there are no sums or differences on the top level. Moreover, 
since the shortest leftmost fragment satisfying a pattern is always 
chosen association to the left is always made. 

The right hand parts of the rules contain rather cryptic notation 
which is the result of a certain srtount of revision of a more straight- 
forward rule set. For example we could write ((LU + RRR) ((-BECN- 
(LLL)) + <-BEGN= (RRR)))). However, when LLL is a fragment containing 
only one element, we would prefer not to write the extra set of 
parentheses* which Is accomplished by the skeleton forta (IWLS XXX) 
defined in the KEPT node. Moreover wc judge DX to be a more suggestive 
name than (»BEGN- (UKLS X)), in accordance with which we introduce the 
appropriate skeleton definitions. 

The fact that the right hand ©ides of the rules use prefix rather 
than infix notation permit© us to define the algebraic connectives as 
functions which effect the obvious simplifications of the crude results. 
For example to differentiate 2*x 2 we obtain as a result 0*x 4- 2*26x l *l, 
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and the resulting superfluous factors of unity or zero sunmands must 
be eliminated, as well as waking adjustments of «cro or unit powers, 
and so on. The alternatives to this mode of operation are to use an 
auxiliary simplification routine which requires a second pass through a 
rather long expression, and for which it is somewhat unclear when to 
terminate a given simplification process or in what order to combine 
:ii-' . Another alternative would be to recognize more complicated 
patterns such as ex for constant c, and prescribe a more appropriate 
derivative which would not need so much simplification. 

The actual rules for the simplification may be as elaborate as 
desired* but those which ve display here suffice to remove the obvious 
redundancies produced In the differentiation, and to clean the result 
up slightly beyond this. For example, the pattern variables K and L, 
which recognize numbers, are employed several tines to avoid indicated 
operations on numbers which could just as well be executed. 

CONCLUSIONS AND EXPERIENCES 

The CONVERT language has been implemented for the IBM 709 computer 
of the Centro Kacional de Calculo (CENAC) of the National Polytechnic 
Institute in Mexico City, as well as for the Q-32 computer of the Systems 
Development Corporation in Santa Monte a, California. However, since it 
is basically written in LISP it is by implication available for computers 
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for which a LIS? system exists, and 8 version is also in operation in 
the IBM 7094 time-sharing systen of Project MAC at HIT. There have 
been several complications which have arisen on account of the LISP 
substrate t and efforts are underlay to provide a direct machine 
language version for various computers. 

The first version was written in MBLISP, a LISP dialect which differs 
from LISP in a number of technical details, but which had a sufficient 
amount of pushdown list and free storage space available to be able to 
execute reasonably complicated examples. However, as an interpreter, 
interpreting a CONVERT Interpreter for a alow machine, it was decidedly 
show. The Q-32 version using a faster machine with a LISP compiler 
gave a very such better performance. One of the prograr.s tested involved 
analyzing a group of order 16, defined in a moderately complicated manner 
as a scai-direct product C8:C2 of cyclic groups of order 8 and 2 
respectively. It was possible to obtain the group table in about 3 
minutes at times when the time-sharing competition was not intense 
which meant about a second per group product, a figure several hundred 
times as fast as for MBLISP in the 709. 

However* the Q-32 LISP has a limited pushdown list available, which 
prevented the execution of quite a number of programs since CONVERT as 
it stands is highly recursive* However, the latest experience has been 
with yet another LISP processor, a compiler, constructed for the CENAC 
by Lowell Hawkinson and Robert Yates, and which they are presumably 
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continuing to develop- It Is unique anvong LISP dialects in having an 
excellent array and floating point numerical capability, as veil as 
being very carefully organized in all its other aspects. As a result 
it is one of our current vehicles for CONVERT programs. 

Our principal applications so far have fallen in two areas. One 
is involved with the Improvement of the CONVERT processor itself. 
The other area has been concerned with group analysis, both for the 
determination of irreducible representations and analyzing the properties 
of .finite groups, and for calculations involving the non-commutative 
generators of Lie algebras. Here CONVERT is applied not only to effect 
the calculations, but again to provide a notation to improve the ease 
vlth which the desired calculation can be Indicated. 

The substance of our experience to date has been that it is indeed 
quite easy to describe programs and adduce novel notations. For a 
certain class of problems whose results are of high importance and 
for which this means of description is especially convenient, the 
application of CONVERT has been quite satisfactory. 

Further extension depends upon a number of factors. Various changes 
in the processor in use at Project MAC, which consist for the most part 
of introducing iterative rather than recursive analysis wherever possible, 
including a CONVERT program feature, have increased Its speed by a 
factor between five and ten. The prospect of a compiler to replace the 
present interpreter without working through LISP as a substrate has 
already been partially realized in the Hawk inson -Yates LISP, which admits 
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a compiled function as a data type. This has permitted the compilation 
of special functions to perform arithmetic operations directly on the 
arrays which we use to represent group elements. This in turn has 
revealed the possibility of introducing whole new hierarchies of 
notation adapted especially for particular problems, as well as to 
alleviate awkward programming situations of a more general nature. 

In suntnary, although we doubt that our system of notation will 
persist unmodified^ the general scheme seems fairly serviceable. 
The transformation rule format, the ability to define patterns and 
operations recursively, the implicit search strategies and construction 
rules, as well as versatility for introducing still other types of 
notation, all combine to produce a very concise programming language. 
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